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Abstract 

By a random billiard we mean a billiard system in which the standard specular reflection rule 
is replaced with a Markov transition probabilities operator P that, at each collision of the billiard 
particle with the boundary of the billiard domain, gives the probability distribution of the post- 
collision velocity for a given pre-collision velocity. A random billiard with microstructure, or 
RBM for short, is a random billiard for which P is derived from a choice of geometric/mechanical 
structure on the boundary of the billiard domain. Such random billiards provide simple and 
explicit mechanical models of particle-surface interaction that can incorporate thermal effects 
and permit a detailed study of thermostatic action from the perspective of the standard theory 
of Markov chains on general state spaces. 

The main focus of the present paper is on the operator P itself and how it relates to the 
mechanical and geometric features of the microstructure, such as mass ratios, curvatures, and 
potentials. The main results are as follows: (1) we characterize the stationary probabilities 
(equilibrium states) of P and show how standard equilibrium distributions studied in classical 
statistical mechanics, such as the Maxwell-Boltzmann distribution and the Knudsen cosine law, 
arise naturally as generalized invariant billiard measures; (2) we obtain some basic functional 
theoretic properties of P. Under very general conditions, we show that P is a self-adjoint 
operator of norm 1 on an appropriate Hilbert space. In a simple but illustrative example, we 
show that P is a compact (Hilbert-Schmidt) operator. This leads to the issue of relating the 
spectrum of eigenvalues of P to the geometric/mechanical features of the billiard microstructure; 

(3) we explore the latter issue both analytically and numerically in a few representative examples; 

(4) we present a general algorithm for simulating these Markov chains based on a geometric 
description of the invariant volumes of classical statistical mechanics. Our description of these 
volumes may also have independent interest. 



1 Introduction 

This extended introduction contains some definitions and an overview of the main results. Additional 
results and refinements are discussed throughout the text. 



1.1 Physical motivation 

Consider the idealized and somewhat fanciful billiard system shown in Figure [TJ At a "macroscopic 
scale" it consists of a point particle, henceforth called the molecule, and a billiard table having 
piecewise smooth wall (only a small part of which is shown). At a "microscopic scale," both the wall 
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and the molecule may reveal further geometric and mechanical structure that can affect the outcome 
of a collision. Thus collisions are not necessarily specular; to specify the outcome of a collision it 
is necessary to consider the interaction between molecule and wall at this finer scale. We suppose 
that the wall system is kept at a constant statistical state, say, a canonical ensemble distribution 
with a given temperature, and wish to follow the evolution of the statistical state of the molecule. 
The outcome of a molecule- wall collision event is then shown to be described by a time-independent 
transition probabilities operator P, to be defined later as an operator on an L 2 space over the set 
of pure states of the molecule. This operator, which as we will see is canonically defined by the 
mechanical/geometric features of the wall and molecule microstructure and the constant statistical 
state of the wall, replaces the mirror-reflection map of an ordinary billiard. We call a system of this 
kind a random billiard with microstructure. or RBM. 




wall system 



Figure 1: An arbitrary molecule- wall system defining a random billiard with microstructure. 

Discrete-time Markov chains associated to P arc interpreted as random states of the molecule 
immediately after each collision, starting from some initial probability distribution, at least for simple 
shapes of the the billiard domain such as cylinders or balls. Besides determining the equilibrium 
states of the molecule as the stationary (i.e., P-invariant) probability distributions for the Markov 
chain, the operator P contains information about rates of decay of correlations and spectral data, 
which can in principle be used to derive transport coefficients such as the diffusion constant of a gas 
of non-interacting molecules moving inside a "billiard channel." (See Figure [3J) 

Here we develop some of these ideas in detail, focusing on the billiard-Markov operator P and 
its relationship with the microstructure. The main results of the paper are concerned with defining 
P for any given Newtonian molecule-wall system, deriving its basic functional analytic properties, 
describing stationary probability distributions, and illustrating with concrete examples some of the 
spectral properties of P. 

There are several sources of motivations for this work, some purely mathematical and others more 
applied. On the purely mathematical side, we seek to have interesting and well-motivated classes of 
Markov chains that can be used to investigate issues of general interest in probability theory, such 
as spectral gap, mixing times, central limit theorems, etc. The statistical mechanics perspective 
in combination with very simple mechanical systems provides a great variety of examples. We also 
believe that generalized billiard systems of the kind we are considering may provide fruitful examples 
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of random (often hyperbolic) dynamical systems with singularities, i.e., random counterparts of the 
widely studied chaotic billiards, for which [3] is a recommended reference. On the more applied side, 
processes of the kind we are studying here may be useful in kinetic theory of gases as suggested, 
for example, in [7] in the context of Knudsen diffusion studies. In the context of the theory of 
Boltzmann equations, operators such as our P may serve to specify boundary conditions for gas- 
wall systems. (See, for example, [2] for the context in the theory of Boltzmann equations in which 
related operators, but not derived from any explicit microscopic interaction model like ours, arise.) 
Our random collision operators provide very natural and simple Newtonian models for the interaction 
of a molecule with a heat bath that can be used to study thermostatic action fairly explicitly and 
often analytically from the perspective of the general theory of Markov chains. 

1.2 The surface-scattering set-up 

The main definitions pertaining to the billiard microstructure are as follows. Let M be a smooth 
manifold whose points represent the configurations of a mechanical system. The system consists of 
two interacting subsystems: the wall and the molecule. A motion in M describes a molecule-wall 
collision event in which the molecule comes close to the the wall surface, scatters off of it, and moves 
away. Let smooth Riemannian manifolds M wall and M mol be the configuration spaces of the wall and 
the molecule subsystems; to capture the idea that there is a direction towards which the (center of 
mass of) the molecule approaches the plane of the wall, and that the microstructure on the wall is 
periodic, we assume that M mol factors as a Riemannian product 

M mol = M mol xRxT fe , 

where M mol is the manifold of molecular configurations under the assumption that the center of 
mass is at a fixed position. For the examples of interest, k < 2. 

In the example of Figure [IJ M wall is simply an interval [0,1], representing the range of positions 
of the wall-bound mass attached to the spring. The Riemannian metric on M mU is derived from 
the kinetic energy of the wall-bound mass. The manifold M mol may be written as SO(2) x [0,/i], 
specifying the spatial orientation (or angle of rotation) of the hollowed little disc and the position 
of the vibrating mass in its interior. The Riemannian metric is, again, derived from the kinetic 
energy of the molecule system, so metric coefficients are given by the values of masses and moments 
of inertia. The plane of the wall is aligned with the factor T 1 and the direction of approach of the 
molecule is the factor ]R in the product. We disregard the possible ("macroscopic") curvature of the 
billiard table boundary — the interaction is imagined to happen at a length scale in which boundary 
curvature cannot be discerned. 

Back to the general case, the combined wall-molecule system is represented by M with a Rieman- 
nian metric and a potential function U : M -*■ R such that (1) the two subsystems are non- interacting 
when they are sufficiently far apart (more details below) and, (2) for each value £ of the total en- 
ergy E, where E{q,v) = \ \v\ 2 + U{q) for (q,v) e TM, the subset of the level set i? _1 (£) consisting 
of states at which the subsystems are a bounded distance from each other has finite volume with 
respect to the invariant volume form S} E , whose definition is recalled later. 

To explain assumption (1), we assume the existence of a smooth function d : M -*■ R, interpreted 
as the distance in Euclidian space from the center of mass of the molecule to some reference position 
on the wall. For each real number a define M mol (a) := M mol x (a, oo) xT fe and M(a) = M mol (o)xM wall , 
and denote by 7r wall and 7f mol the projections onto M wall and M mo] , respectively. Then we suppose 
that there exists an ao e R, which can be taken with no loss of generality to be less than 0, such 
that, for all a > ao, 

i. the set {q e M : d(q) > a} is isometric to, and will be identified with, M(a); 
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ii. there are smooth functions U lnol : M mo 

U\m(o,) 



-* R and t/ wall : M waU 



R such that 



hi. for each value £ of the energy function E, the level set {v e T(M \ M(a)) : £7(u) = £} has 
finite volume relative to f2 B ; 

iv. the system is essentially dynamically complete, in the following sense: Any smooth curve 
1 >-»■ c(i) that satisfies Newton's equation (with acceleration defined in terms of the Levi-Civita 
connection) 

Vc'(f) 



dt 



-grad c(t) E7 



can be extended indefinitely in the interior of M, until it reaches the boundary; whenever c 
intersects the boundary transversely at a regular point q = c(t), it can be extended further back 
into the interior along the unique solution curve with initial state (q,w), where w = R q c'(t) 
and R q : TM -* TM is the standard reflection map. 

In the example of Figure^ A/ moI is the two-point set {-1, 1} labeling the two sheets of M above 
a certain distance from the handles. The manifold M wall consists of a single point and the potential 
function U is constant. We give M, say, the Riemannian metric induced from Euclidean 3-spacc. 
More representative examples, in which M walJ is non-trivial will be shown later. 



B" 




jl 



Figure 2: Geodesic motion on a periodic surface representing a molecule-wall scattering process. 



As already noted, the various manifolds above may have boundary. Boundary points represent 
collision configurations. It is necessary to accept manifolds whose boundaries may not be smooth. 
For concretcness, we adopt here the class of manifolds with corners (see [10j). which is general 
enough to provide plenty of meaningful examples. In particular, M contains a set d B M, the singular 
boundary, the complement of which is a smooth manifold with boundary in the ordinary sense of 
being modeled on open subsets of the upper half space. This complement is the union of the interior 
set M°, and the (regular) boundary d r M. Moreover, d a M is contained in the closure of d T M, it 
is nowhere dense in this closure and has measure in dM. Since we are mainly interested in 
probabilistic questions, it is typically safe to ignore the singular boundary set. 

If q is a regular boundary point of M and v is a unit vector perpendicular to the boundary at q, 
we assume that a motion in M is extended after hitting the boundary at q in such a way that the 
pre- and post-collision velocities v and v' are related according to the standard linear (reflection) 
map » h> »' := v - 2{v,v) q v; so "microscopic" collisions are specular. Being an isometry of the kinetic 
energy metric, this map leaves the energy function E invariant. 

Let S be the level set d = in M, i.e., 

S = M mol x {0} x T k x Af wall 
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and Ns the restriction of TM to S (more precisely, the pull-back of TM under the inclusion S ^ M). 
Informally, crossing S amounts to entering the zone of interaction M \ M(ao), though S itself lies 
in the product zone. The vectors in Ng pointing into the zone of interaction form the subset Ng. 
This is the set of incoming states. The set of outgoing states, Ng, is similarly defined as the set 
of vectors in Ns pointing out of the zone of interaction. Omitting, as we often do, the base point 
in M when referring to a state in Ng, then v h> —v sends an element of N§ to an element of Ng. 
Let H x R x T k = T(R x T k ) n Ng, where H is the half-space in R fe+1 . Then the incoming states 
decompose as a product 

jV+ = N mol x T fe x 7V wall 

where iV wall := TM wall and N mol := TM mol x H. We have chosen this particular decomposition so that 
the "observable" quantities of the molecule are grouped into the first factor and the quantities to be 
chosen probabilistically arc grouped into the second and third factors. 

A collision event is defined by an application of the map T : Ng -* Ng , which gives the return 
state from an initial state in Ng, obtained by integrating the equations of motion. Under our general 
assumptions this map is defined on almost all initial states by Poincare recurrence, and for many 
systems of interest it can be shown that T is smooth on a dense open set of full measure. We make 
this almost everywhere smoothness a standing assumption. For simplicity, we indicate the domain of 
T simply by Ng, ignoring the fact that it is really defined on an open dense subset of full measure. 
It is convenient to redefine T by composing it with the reflection map R : Ng Ng, so that T 
becomes a self- map of Ng. Thus we add to the above list of assumptions: 

v. the return map T is smooth on an open dense subset of Ng of full measure. 
1.3 The Markov operator 

Let r\ be any given probability measure on T fe x N mll . The physically most natural and interesting 
choice for r\ corresponds to taking the product of the uniform distribution on T fc and the Gibbs 
canonical distribution on TM wall with parameter f3 = 1/kT, whose definition is recalled later. The 
choice of measure fixes the statistical state of the wall system. The collection of possible states of 
the molecule system is the space CP(-/V mo i) of Borel probability measures on iV mol . We now define the 
map 

P:?(iv mol ) -+y(jv mol ) 

that associates to each statistical state \x e l 3'(N laol ) the new state fiP := (ir o T)*(/j, ® rj). Notations 
and general explanations are further provided in Section [21 The interpretation is that, to obtain the 
return statistical state of the molecule, we take its present state fj,, form the combined state n®r] of 
the system, let it evolve under T, thus yielding T*(/x <8> 77), and finally project the outcome back to 
iV mol under the natural projection 7r : Ng -> iV mol . The asterisk indicates the push-forward operation 
on measures. 

Consider again the system of Figure [2] as an example. In that case M waU is trivial (a single point) 
and jV mol is identified with {-1,1} x H, where H is the half-plane in IR 2 . It does not make sense 
in this case to consider a Gibbs canonical distribution — a natural measure 77 here is the uniform 
probability distribution on T . Since in this example the speed of the particle does not change, we 
consider not the full JV mol but a level set for the energy (say, only states with unit velocity). So we 
let AT mol = {-1,1} x (-7r/2, 7r/2), which parametrizes the sheet number (±1) and the angle 9 of the 
incoming trajectory relative to the normal to the wall plane. 

Writing s = (b,9) for a point in N mol , we can define P by first indicating how it acts on, say, 
essentially bounded functions on iV mol and then defining its action on J ) (7V mol ) by duality, (/xP)(/) = 
fi(Pf), where /i(/) indicates the integral of / with respect to fi. Thus if / is a bounded function on 
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N mol , and ^ s (x) is the state of return to N mol under T for (s,x) e N mol x T 1 , then from the general 
definition we have, 

(Pf)(s)= f f(* s (x))dx. 

When the (macroscopic) billiard table is a channel as shown in Figure |3l iterates of P give the 
post-collision states of a random flight of the molecule (say, in a gas of non-interacting molecules) 
inside the channel. 



Figure 3: A random flight in a channel. The molecule's state after each collision is specified by P and the 
pre-collision state. The stationary distribution fj, is interpreted as describing the state after the molecule has 
reached thermal equilibrium with the wall. 



Diffusion approximation of the random flight and the dependence of the diffusion constant on the 
spectrum of P are issues of particular interest, which will be investigated in another paper dedicated 
to central limit theorems for P and related topics. 

1.4 Overview of the main results 

The starting point of our analysis is a determination of the stationary probability measures of P. 
Recall that a probability measure fi is said to be stationary for P if fiP = fi. We consider two 
possibilities: (1) the space M wall reduces to a point, in which case the wall is regarded as a rigid, 
unmoving body that does not exchange energy with the billiard particle in a collision. Note that, in 
this case, the billiard chamber might still have a non-trivial structure, but it does not contain moving 
parts. In this case, the measure r\ that enters in the definition of P is taken to be the normalized 
Lebesgue measure on T k ; (2) the space M wall has dimension at least one. This means that the wall 
system has moving parts and energy can be transferred between wall and molecule in a collision. In 
this case we assume for the purposes of the next theorem that rj is the product of the normalized 
Lebesgue measure on T fc and the Gibbs canonical measure on the phase space iV waU with a fixed 
parameter j3. The latter can be written as follows (a fuller discussion of invariant measures is given 
in the last section of the paper): 

e -PE 

(1-1) <% = ^yKuAd£|, 

where E is the energy function on N„ aU , f2^f all is the invariant (Liouville) volume form on energy 
level sets in iV wall derived from the symplectic form on this space, and the vertical bars indicate the 
associated measure. The denominator is a normalization factor. 

We consider similar measures on 7V mol := TM mol x H. More precisely, in case (1) we fix a value £ 
of the energy function of the molecule, which remains constant throughout the process, and consider 
the microcanonical measure for this value, given by 

(1-2) ^ = pi ohS \ , 

where S is the hypersurface of separation between the product zone and the zone of interaction 
previously described, ^£ o1 5 is the invariant volume form on the part of the level set E = £ above S 
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and the denominator is a normalizing factor. In case (2) we define 



(1-3) d^ = ^-\^ s AdE\. 

A description of these measures better suited for applications will be given shortly. In the special 
case when M mol reduces to a point, so that N mol := H, these measures are as follows: in case (1) we 
may choose £ so that the molecule state lies in the unit hemisphere in H. Indicating by cj sphcro the 
standard volume form on the unit hemisphere and by v the unit normal vector to S, say, pointing 
into the zone of interaction, we have, up to a normalization constant C, 

dn(v) = C{v,v)\w* bm \, 

which we refer to as the Knudsen probability distribution; and in case (2) 

dfj,(v) = C{v,iy)e- /3l ^dV(v) 7 

where {v,v) and dV(v) are, respectively, the standard inner product and volume element in R fe+1 . 
This measure is the Maxwcll-Boltzmann distribution at boundary points. 

Theorem 1. Let P : 7{N mol ) -* 7(N mol ) be the Markov operator associated to a probability measure 
■q on T k x N wall . 

1. In case (1) above, let r\ be the normalized Lebesgue measure on T fc . Then the microcanonical 
distribution ] 1.2\ is a stationary probability for P. 

2. In case (2) above, let rj be the product of the normalized Lebesgue measure on T k and the 
Gibbs canonical distribution on N maU with temperature parameter [3. Then the Gibbs canonical 
distribution on N mol given bv \1.3l with the same parameter f3, is a stationary probability for P. 

In the particular case when the molecule reduces to a point, the stationary measures are, respectively, 
the Knudsen distribution in case (1) and the boundary Maxwell-Boltzmann distribution in case (2). 

The proof of this theorem is given at the end of Subsection 15.41 The stationary probability is 
often (but not always) unique and one often obtains convergence of /io-P" to the stationary state 
for any initial distribution [io s t J ) (N laol ). Thus the dynamic of a Markov chain derived from P 
describes the process of relaxation of the molecule's state toward thermal equilibrium with the wall. 
To understand this process in each particular situation it is necessary to study the operator P in 
more detail; we do this later in the text for a few concrete examples. 

The following definition is further elaborated in Section [2] We say that the molecule- wall system 
is symmetric if on the state space Nt, are defined two automorphisms J and S such that: 

i. these maps preserve the natural measure fig on Ng(£) derived from the symplectic structure; 

ii. they respect the product fibration it : Ng = N mol x T fe x N wb3S -* N mol and both induce the same 
map J on iV mol ; that is, 

7roJ=7ro5 = Jot:; 

iii. J is time reversing: J o T = T _1 o J; and S commutes with T: S o T = T o S. 

The existence of the map J is typically assured by the time reversibility of Newtonian mechanics 
and the symmetry S can often be obtained by a simple extension of the original system that does 
not affect its essential physical properties. (This is akin to defining an orientation double cover of 
a possibly non-oriented manifold.) The assumption of symmetry is thus a very weak one. These 
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points are further discussed in Section [5] and in some of the specific examples studied later in the 
paper. 

It is natural to consider the associated operator, still denoted P, on the Hilbcrt space L 2 (N mol , fi) , 
where \i is one of the stationary probabilities obtained in Theorem [T] We are particularly interested 
in the spectral theory of P. A first general observation in this direction is the following. 

Theorem 2. Let \i be the stationary measure of P obtained in Theorem [7] and suppose that the 
system is symmetric. Then P is a self-adjoint operator on L (N mo i, /j,) of norm 1. 

In particular, P has real spectrum contained in [-1,1]. It is often the case (this will be proved for 
a simple but representative example later in this paper, and has been shown for special classes of P in 
previous papers; see [HI IE]) that P is a compact, integral operator (Hilbert-Schmidt). The eigenvalues 
of P are then invariants of the system, depending in a canonical way on structural parameters like 
mass ratios, potential functions, curvatures, etc. The relationship between the spectrum of P and 
these parameters is one of the central issues in this subject. 

Of particular interest is the spectral gap of P, defined as f minus the spectral radius of the 
restriction of P to the orthogonal complement to the constant functions. As is well-known (see, for 
example, [13]) the spectral gap can be used to estimate the exponential rate of convergence of [ioP n 
to the stationary distribution in the total variation or the L 2 norm. 

A perturbation approach to the spectrum of P, which is valid when the molecule scattering is 
not far from specular, can be very fruitful. To make sense of this, first define 

£ 2 («) :=E v [dist(v,V) 2 ], 

where dist is a distance function on iV mol , v e N mol is an initial state, V is the random variable 
representing the scattered state after one collision event, and E v [-] is conditional expectation given 
the initial state v. We call £2 the second moment of scattering. Under the identification of Nt, and 
Ng (see above), specular reflection corresponds to V = v almost surely and small deviations from 
specularity correspond to small values of the second moment of scattering. We now defined the 
operator 

£ P :=2(P-J)/£ 2 , 

which we refer to as the random billiard Laplacian (or the Markov Laplacian) of the system. The 
billiard Laplacian, for small values of £2 often approximates a second order differential operator. 
In the examples studied, this will be seen to be a (densely defined) self-adjoint operator on the 
same Hilbert space on which P is defined, whose eigenvalue problem amounts to a standard Sturm- 
Liouville equation. 

I 

, ^ N 

I— . U+H. 

mi ■ m,2 



Figure 4: A simple example described in Theorem [3] 



In Section [3] we explore these ideas in detail with an example. The example consists of two 
point masses (see Figure U]) constrained to move along the half-line [0, 00). Mass mi, with position 
coordinate x±, is restricted to move in the interval < X\ < I and m 2 , with position coordinate x 2 , 
can move freely on x\ < X2 < °°. The two masses collide elastically, and mi collides clastically with 
walls at and I. The wall at I is regarded as permeable to 7712 but not to mi. The random state r\ 
of mi is taken to be the product of the uniform distribution over [0, 1] and a Gaussian probability 
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with mean and variance a 2 for its velocity. Mass m-i is the molecule and mass ra\ is part of 
the wall system. We refer to this as the two-masses system. To simplify notation and for other 
conveniences we rescale positions and velocities according to x ■- \fm\fmx\ and y = v/ m-i JmX2, 
where m = mi + 777,2. The main structural parameter of the system is the mass-ratio 7 := v/ 7772/7771. 
We let P 7 represent the Markov operator with mass-ratio 7. Further details are explained in Section 
[3j We summarize in the next theorem some of the main conclusions obtained for the two-masses 
example. (Further refinements and numerical calculations are described in that section.) 

Theorem 3 (Case study). The following assertions hold for the two-masses system with 7 < l/\/3: 

1. P 7 has a unique stationary distribution p. Its density relative to Lebesgue measure on (0,oo) 
is given by 

p(v) = CT -1 vexp 

2. For an arbitrary initial probability distribution po, we have [|/xo-P™ - p\\tv -*■ exponentially 
fast in the total variation norm. 

3. P 7 is a Hilbert- Schmidt operator. 

4- If ip is a function of class C 3 on (0, oo), then the billiard Laplacian has the following limit 

(L(p)(v) = hm — '- — :=[--v\<p(v)+<p (v). 

7^0 27^ \v ) 

Equivalently, L can be written in Sturm- Liouville form as Lip = P~ X 4~ (p^) i which is a densely 
defined self-adjoint operator on L 2 ((0, 00), /j). 

Based on part 4 of the above theorem and a simple analysis of the corresponding Sturm-Liouville 
eigenvalue problem (the equation in part 4 is, after the change of coordinates x = v 2 /2, Laguerre's 
equation), we can make an educated guess as to the asymptotic value, for small 7, of the spectral 
gap of P 7 : it is given by 47 2 . Although we do not prove this here, we offer in Section [3] numerical 
evidence for its validity. This gives the following refinement of item 2 of Theorem [3] 

|| / ^oP y n -/^||TV<C(l-47 2 )" 

C is a positive constant and 7/7 -*■ 1 as the mass-ratio parameter 7 approaches 0. (A general spectral 
perturbation study of our Markov operators with small moment of scattering based on comparison 
with Sturm-Liouville eigenvalue equations will be given in another paper.) 

Theorem [3] is proved in Section [3J Section U discusses further examples in less detail. 

1.5 An algorithm for the Markov chain simulation 

The map T generating the deterministic discrete dynamical system involved in the definition of P 
is essentially a kind of billiard map |3j in a very general setting. For applications of Theorem [TJ 
particularly in numerical simulations of the Markov chains, we need a convenient expression for 
the corresponding invariant billiard measure and for the Gibbs distributions. We recall that the 
standard invariant billiard measure for planar billiards can be described as follows: If states of the 
billiard system are represented by coordinates (s,#), where s € [0,1] is proportional to arclength 
measured from a reference point on the boundary of the billiard tabic (assuming that this boundary 
has finite length) and 6 e [-ir/2,%/2] measures the angle that a unit velocity vector based at the 
point represented by s makes with the inward pointing normal vector, then dfi(s,9) = ^ cos 9 dsdO 
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is the canonical invariant probability measure on the two-dimensional state space of the billiard 
system. See, for example, [3]. We wish to have a similar description of the invariant billiard measure 
on boundary components of general Ricmannian manifolds in the presence of potential functions. 
Although this is something rather classical and possibly relatively well-known, we could not find 
it in the literature in a form that is convenient for our needs. It may thus be of some interest to 
highlight such an expression here. After doing this, we give at the end of this subsection the outline 
of an algorithm for generating Markov chains associated to general microstructures. 

The next theorem is of some independent interest having to do with a representation of the 
standard volume measures of classical statistical mechanics. Our primary interest is to apply the 
theorem to a neighborhood of S in the non-interaction zone of M described above. But, in the 
interest of generality, the notation in this section is similar to, but independent of what was used 
above. In particular, we will reuse M to mean any smooth Ricmannian manifold with corners and 
U •" M -*■ M any smooth potential function. 

As always, we let the kinetic energy be defined by the Ricmannian metric, k(v) = |||f|| 2 , and 
consider the Hamiltonian flow on N = TM with standard energy function E := k + U o r, where 
r : N -*■ M is the base-point projection. Let m be the dimension of M and S an (m— l)-dimcnsional 
submanifold of the boundary of M. Let JVg(£) be the intersection of Ng, defined just as in the 
more specialized setting of the molecule-wall systems, with the constant energy submanifold E = £ 
of N. We assume that the return map T : 7VJ(£) -> Ng(E) is defined (in the a.e. sense described 
above); let v be the unit normal vector field on S pointing towards the interior of M; and for any 
given value £ define h E ■ M(£) -» R, where M(£) := {q e M : U(q) < £} and 



(1.4) he(q) ■■= V2(£ -£/(</))■ 

Extend h E to all of M by setting h E = on the complement of M(£). Now let W be an open set in 
M(£) on which is defined an orthonormal frame of vector fields e\, . . . , e m ; let Nw(&) be the part 
of JV(£) above W, and let F E ■ W x S" 1 ' 1 N W (E) be the map such that 

Fs.{q,u) = \ q,hz(q)^Uiei{q)\, 

where S"™ _1 is the unit sphere in R m . Then F E is a diffeomorphism and u F E (q,u) maps the 
unit sphere bijectively onto the fiber of A(£) above q. If W intersects S or more generally dM, we 
assume that at any q e W n dM the vector e m (q) is perpendicular to T q (dM). 

The Ricmannian volume form on M will be denoted by ui M and that on S by lo s . Recall that 
the relationship between u> M and uj s is that w s = v J oj m , the interior multiplication of ui M by the 
unit normal vector vector field on S. Let w s P horo be the Euclidean volume form on S"™ _1 , which 
is obtained from the standard volume form on R m by interior multiplication with the unit radial 
vector field. In Section [5] we define the (microcanonical) invariant forms Q, B and fig on ^V(£) and 
A^(£), respectively, in terms of the symplectic form and prove the following. 

Theorem 4 (Invariant volumes). For any choice of orthonormal frame over an open set W c M(£), 
and given the frame map F E : W x S"™ _1 -*■ N\y(£) defined above, the form D, E satisfies 

T?*r\E , _7 m-2 M . sphere 

r E il = ±mh E lu Aw 

If W is a neighborhood of a point in S, we similarly have 

F|nf = ± cos 6 hl 1 - 1 ^ a oj sphcr % 

where 0(v) is the angle that v e T q M makes with v(q) for q e S . Apart from the unspecified signs, 
these expressions do not depend on the choice of local orthonormal frame. 
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The first volume form is invariant under the Hamiltonian flow, and the second is invariant under 
the return map to S. We refer to the latter as the billiard volume form and to the former as the 
Liouville volume form. The Gibbs canonical distribution with temperature parameter (3 is then the 
probability measure obtained from the volume form 

-0Eim-2lTJ . M , sphere 

e /i £ ah/ auj a uj 

on N; similarly the Gibbs volume form is defined on Ng using the billiard volume form just intro- 
duced above. These volumes on N and on Ng are also invariant under the Hamiltonian flow and 
under T, respectively. The probability measure on iV|(£) associated to the billiard volume form 
may also be called the Gibbs microcanonical distribution. Note how the probability of occupying a 
state of high potential energy in the microcanonical distribution depends on the potential function 
(thus on the position in M) due to the term h™^ 1 . 

Given the representation of the invariant volumes of Theorem 21 we can state the Markov chain 
algorithm as follows. 

MCI. Start with a £ old e N mol , representing the state of the molecule prior to a collision event; 

MC2. Choose £ with probability density proportional to exp(-/3£), representing the energy of the 
wall system; 

MC3. Choose q e M wall (£) = {q e M wall : U mU (q) < £} with probability density proportional to 
h™~ 2 relative to the Riemannian volume (which defines uniform distribution), where m is the 
dimension of M wall ; 

MC4. Choose a random vector u over the unit sphere in T g M wall with the uniform distribution; 
MC5. Set the state of the wall prior to the collision event to (q, h^(q)u); 

MC6. Use the combined state (£, old ,q,he.(q)u) as the initial condition of the molecule-wall system 
prior to collision and let it evolve according to the deterministic equations of motion until the 
molecule leaves the zone of interaction; record the state £ now of the molecule at this moment. 

This procedure is illustrated in Subsection l4.2l with an example that is similar to that of Sectional] 
but now involving a non-constant potential function. Similarly interpreting Theorem [1] a (typically 
unique) stationary distribution for a Markov chain with transitions £ old >-»■ £ ncw can be sampled from 
in the following way: 

SD1. Choose £ with probability density proportional to exp(-/3£); 

SD2. Choose q in M mol x T fc with probability density proportional to h 1 ?' 1 , where m is now the 
dimension of the latter manifold; 

SD3. Choose a random vector u over the unit sphere in T q M mol with probability density proportional 
to cos 9, where 9 is the angle between the velocity of the molecule's center of mass and the 
normal to the submanifold S (which the molecule has to cross to enter the region of interaction) ; 

SD4. Set the sample value of the equilibrium state of the molecule to be (q,hz(q)u). 
Theorem [5] is proved in Subsection 15.51 Proposition 1151 

2 Random dynamical systems 

In this section we derive a few general facts concerning random dynamical processes with the main 
goal of proving Theorem [2] A useful perspective informing this discussion is that our Markov 
chains arise from deterministic systems of which only partial information is accessible. The notation 
employed below is independent of that of the rest of the paper. 
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2.1 The Markov operator in general 



Let 7r : M -»■ X denote a measured fibration, by which we simply mean a measurable map between 
Borel spaces together with a family of probability measures 77 = {rj x : x e X} on fibers, so that 
Vxiir (x)) = 1 for each x. The family is measurable in the following sense: If / : M -»■ [0, oo] is a 
Borel function then x >-»■ T] x (f) is Borel, where r) x (f) indicates the integral of / with respect to rj x . 
We refer to 77 as the probability kernel of the fibration. 

A random system on X is specified by the data (it, T, rj, /x), where 7r is a measured fibration with 
probability kernel 77, is the initial probability distribution on X, and T ■ M -»■ M is a measurable 
map. We think of the map T as the generator of a deterministic dynamical system on the state 
space JVC. A point £ in M represents a fully specified state of the system, of which the "observer" can 
only have partial knowledge represented by 7r(£). (It is not assumed that T maps fibers to fibers.) 

From this we define a Markov chain with state space X as follows. Let \i be a probability measure 
on X representing the statistical state of the (observable part of the) system at a given moment. 
Then the state of the system at the next iteration is given by 



The notation should be understood as follows. From ji and 77 we define a probability measure ji o r\ 
on M so that for any, say L°° function / : M -*■ 1R 



The push-forward operation on measures is defined by T*i/(f) := v(f °T). The result is an operator 
P taking probability measures to probability measures, which we refer to as the Markov operator. 
When it is helpful to be more explicit we write, say, -P^t or Prj, instead of P. 

The probability kernel rj is the family of transition probabilities of the Markov chain. In keeping 
with standard notation, we let P act on measures (states) on the right, and on functions (observables) 
on the left. Thus Pf is the function such that (J.(Pf) = (fxP)(f) for all fi. It follows that 



We say that 77 is the disintegration of a probability measure conM relative to a probability /1 on 
X if v = fi o n. 

A probability measure v on M is invariant under T if T*v = z/, and a probability measure fx on 
X is stationary for the random system if /_tP = fi. 

Proposition 1. Let v be a T -invariant probability measure on the total space M of the random 
system (tt,T, 77, /i) and suppose that rj is the disintegration of v with respect to ji := ix^v. Then ix±v 
is a stationary probability measure on X . 

Proof. This is immediate from the definitions: 



Let n be a probability measure on X and define the Hilbcrt space L 2 (X,ji) with inner product 



ji >->■ jlP 



(it o T)*h o 77. 





(7r*7/)P = (n o T)t(7r*f) o 77 = tt^T^u = -k^v. 



We have used that 7r*(/i ° 77) = ji. 



□ 
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Proposition 2. Let {tt,T, 77, /x) be a random system, where T is an isomorphism (thus it has a 
measurable inverse) of the measure space M. and v := fi°r) is T -invariant. Let P v ,t be the associated 
Markov operator. Then P v ,t, regarded as an operator on L (X,fj,), has norm \\P TIi t\\ = 1 an d its 
adjoint is P~ T = P^.t- 1 ■ 

Proof. Jensen's inequality implies 

2 

\\Pv-Tff= f [ lf /(7r°T(0)d7fe(0 d»(x)< [ f \f(n°T(Of dr, x (OdKx)- 

JX J7r~ 1 (x) JX Jtt {x) 

The integral on the right equals f M \ f\ 2 o tt o T dv = f M \ f\ 2 o tt dv, by T-invariance of v. As / o tt is 
constant on fibers, this last integral is ||/|| 2 , showing that the norm of the operator is bounded by 
1. Taking / = 1 shows that the norm actually equals 1. To see that the adjoint equals the operator 
associated to the inverse map, simply observe the identity 

which is due to T-invariancc of v. □ 



2.2 Time reversibility and symmetry 

Let (7r,T, ry, \x) be a T-invariant random system, which means by definition that v = fiorjisa,T- 
invariant measure so, in particular, fj, is stationary. We say that the system is time reversible if there 
is a measurable isomorphism J : M -»■ M respecting tt and v, in the sense that it maps fibers to fibers 
and J*v = v, and satisfies 

ToJ= Jor 1 . 

Since J respects tt, it induces a measure preserving isomorphism J : X -* X (for the measure fi) 
such that Jott = tt o J. We denote also by J the induced composition operator on L 2 (X,fi), so that 
J f := foj. Notice that such J is a unitary operator on L 2 {X,n). We call J the time-reversing map 
of the system. 

Proposition 3. Let (tt,T,t], /jl) be a T-invariant random system with time-reversing map J, and J 
its associated unitary operator on L 2 {X,[i). Then P^t = J*Pii,tJ- 

Proof. A straightforward consequence of the definitions is that 

(P, hT Jf)(x)= f (/o^^oj^))^^), 

from which we obtain 

(Pn,TJf, Jg) = fjf ° tt) (T- 1 o J(0)g(TT o J(0) <M0- 

The last integral is now seen to be equal to f M /(7r(C))5(7r(7 1 (^))) dv{^) = (f,P Vy Tg) by using the 
invariance of v under J and T. □ 

We say that S ■ M -*■ M. is an automorphism, or a symmetry of the random system if it is a 
measurable isomorphism commuting with T that respects tt and v = fi ° rj. Thus S covers a measure 
preserving isomorphism of X, which we denote by S. 

Definition 1. The T-invariant, time reversible random system (jt,T,T), fi) with time reversing map 
J will be called symmetric if there exists an automorphism S whose induced map S on X coincides 
with the map J induced from J. 
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Proposition 4. Let (tt,T, rj,fi) be a symmetric (hence time-reversible and T -invariant) random 
system. Then the Markov operator Pr),T is self-adjoint. In particular, -P^t- 1 = Pr/,T- 

Proof. Given Proposition^ it is enough to verify that P v ,t commutes with the operator J. Keeping 
in mind that J = S, T o S = S oT, and that v is 5-invariant, we obtain 

f M f(J°*° T(0 ) g (tt(O) dv(0 = f M f(*°S° T(0)g (n(0 ) dv(0 

= f M f (tt o T(0) g (J' 1 o tt(0) <M0- 

This means that (Pr],TJf,g) = {Pri,Tf,Jg)- The claim follows as J is unitary. □ 

Corollary 1. Let (tt,T, n,[i) be a T -invariant random system, where T is an involution, i.e., T 2 
equals the identity map on M. Then the Markov operator P n .T is self-adjoint. 

Proof. Since T = T -1 , the identity map on / is both a time reversing map and a symmetry of the 
system. □ 

The situation indicated in Corollary [T] essentially describes a general stationary Markov chain 
satisfying detailed balance. In that case, X is the state space of the Markov chain, M = X x X 
(or, more generally, a measurable equivalence relation on X), and rj x is a probability measure on X 
for each x e X. For T we take the (groupoid) inverse map (x,y) >-> (y,x). As a special case, we 
suppose that X is countable and write in more standard notation p xy := n x ({y}) and ir x := 
Then the random system is symmetric as define above exactly when the Markov chain on X with 
transition probabilities (p xy ) and stationary distribution (n x ) satisfies the detailed balance condition 

T^xPxy = VyPyx for all X, y 6 X. 

2.3 Quotients 

A minor technical issue to be mentioned later calls for a consideration of quotient random systems. 
Suppose that G is a group of symmetries of the random system (7r,T, 77,/i) and that the action of 
G on M, as well as the induced action on X, are nice in the sense that the quotient measurable 
spaces are countably separated. (See, for example, [14] . where such actions are called smooth. In the 
situations of main concern to us, G is a finite group acting by homeomorphisms of a metric space, 
in which case the condition holds.) Without further mention let the actions of G be nice. 

We denote the quotient system by (7P, T, 77, Jl) , and the other associated notions, such as M and 
V, are similarly indicated with an over-bar. The quotient maps M -*■ M and X -*■ X will both be 
indicated by p. These maps and measures arc all defined in the most natural way. For example, T 
is the transformation on M such that T o p = p o T, which exists since G commutes with T,JI = p*fx, 
etc. Since G leaves [i invariant, it is represented by unitary transformations of L 2 (X,/i) commuting 
with P v ,t- So it makes sense to restrict P v ,t to the closed subspace L 2 G (X, /i) of G-invariant vectors, 
which is isomorphic to the Hilbert space L 2 (X,~p) under the map p* : L 2 (X,~p) -> L 2 j(X,/j,) that 
sends / to / o p. Thus we may identify the quotient Markov operator P- -f with the restriction of 
Pt),t to the G-invariant subspace. 

We use these remarks later in situations where the initial system does not have any symmetries 
that would allow us to apply Proposition 21 but it can nevertheless be regarded as the quotient of 
another system having the necessary symmetry. 
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3 A case study: the two-masses system 



In this section we prove Theorem [3] and further refinements, which are concerned with one simple 
but illustrative molecule-wall system. Other examples are described more briefly later in the next 
section. 

3.1 Description of the example 

Let Xi,X2 represent the positions of two point masses restricted to move on the half-line [0, oo), 
with respective masses mi and TO2- Mass mi, called the bound mass, is restricted to move between 
and I, and it reflects elastically upon hitting or I, moving freely between these two values. Mass 
TO2, called the free mass, moves freely on the interval [x\, oo), and collides elastically with mi when 
x 2 = x\. There are no forces or interactions of any kind other than collisions. We are interested in the 
following "scattering experiment" : Mass m,2 starts at some place along the half-line with coordinate 
x 2 > I and speed s > 0, moving towards the origin. It eventually collides with m 1; possibly several 
times, before reversing direction and leaving the "interval of interaction" [0,1]. When it finally 
reaches again a point with coordinate x 2 > I, we register its new speed S > (now moving away 
from the origin). It is assumed that we can measure s and S exactly, but that the state of mi 
at the moment the incoming mass crosses the boundary x 2 = I is only known up to a probability 
distribution. 




Figure 5: A two-dimensional billiard system corresponding to the one-dimensional scattering example 
described in the text. For the random system, start with a v e (0, oo), then choose a random (uniform) 
position along the horizontal dashed line and a random value for w according to a given, fixed, probability 
distribution on R. Then execute the billiard particle motion inside the triangle, with that initial position 
and initial velocity wei -ve2, where d are the standard basis of R 2 . Eventually, the billiard particle returns 
to the horizontal dashed line. When it does, let V be the absolute value of the vertical component of the 
particle's velocity. Then v >-*■ V is the random map for the free particle scattered velocity. 

More specifically, we assume that, at that moment in time, x\ is a random variable uniformly 
distributed over [0,/] and that the velocity v\ e K, of the bound mass has a known probability 
distribution that we do not yet specify. It is imagined that I is very small ( "microscopic" ) and that 
the bound mass is part of the wall, whose precise dynamical state is thus only known probabilistically. 
We wish to investigate the random process s i-> S, which gives the random speed of the free particle 
after one (macro-) collision with the wall (consisting of possibly many collisions with mi), given its 
speed before the collision. 

The state of the system at a moment when x 2 = I is fully described by the triple (x\, V\, s). It will 
be convenient to use coordinates x := \Jm~[fmx\, y = \J ra 2 \mx 2 , w = \Jrnijmvi and v = \fm 2 Jms, 
where m = mi +m 2 . A configuration of this two-particle system corresponds to a point (x,y) in the 
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billiard table region depicted in Figure [5J By doing this coordinate change we now have ordinary 
billiard motion, i.e., uniform rectilinear motion between collisions and specular collisions at the 
boundary segments. (The coordinates change turns the kinetic energy of the system into the norm, 
up to uniform scaling, associated to the standard inner product in 1R 2 .) 

States of the two-particle system are represented by tangent vectors on the billiard region. Ex- 
pressing the situation in the language of Section [21 let M be the set of states for which y = IWm^Jrn 

(see Figure [5]); omitting the y-coordinate, we write M := ^(x,w,v) : x e j^O, ^mi/mj ,v > oj. The 
observable states are represented by X = (0, oo) and we let it : M -> X be the projection on the 
third coordinate. The transformation T : M -» M is the billiard return map to the horizontal dashed 
line of Figure [SJ We choose the measure rj v on the fiber of v e X to be the product A ® £ of the 
normalized Lcbesgue measure on ^0, l^/mi/m^ and a fixed probability measure £ on R. 

For each choice of £, we obtain a Markov operator P of a random process with state space (0, oo). 
Such a process may be interpreted as a sequence of successive collisions of the free mass with the 
"microstructured wall." It may be imagined that the free particle actually moves in a finite interval 
of arbitrary length bounded by two such walls having the same probabilistic description, so that 
the process defined by P describes the evolution of the random velocity of mass mi as it collides 
alternately with the left and right walls. 



3.2 Stationary probability distributions 

For concreteness, let us choose the measure C to be the absolutely continuous probability measure 
on 1R with Gaussian density p wa ii(io) = (o\f 7 2/K')~ 1 exp (-iu> 2 /<7 2 ). This amounts to assuming that 
the state of the bound mass satisfies the Gibbs canonical distribution with temperature proportional 
to a 2 . One should keep in mind that by the above change of coordinates the kinetic energy of the 
bound mass is simply |w 2 , so it makes sense to refer to a 2 as the temperature of the wall. 

Proposition 5. Let P be the Markov operator with state space (0, oo) for the random process of the 
two-masses system. We assume that the state of the bound mass (or the wall system) has the Gibbs 
canonical distribution A <8> C with wall temperature a 2 . Then P has a unique stationary distribution 
fx, whose density relative to the Lebesgue measure on (0, oo) is the Maxwell-Boltzmann distribution 
with the same temperature: 

If n is any Borel probability measure on (0,oo), then r]P n converges in the weak-* topology to the 
stationary probability measure. 

Proof. We begin by showing that the Maxwell-Boltzmann distribution is indeed stationary for P 
without invoking the more general Theorem [TJ Let fj, denote the probability measure with density 
p froo . Then the probability measure ^ = /i o ?j on M is /z <8> £ ® A, which has density 

/ v 2 +w 2 \ 
p{x,v,w) = Cuexp I — I dxdvdw, 

\ 2(7 ) 

where C is a normalization constant. Let 9 be the angle in [-7r/2,7r/2] that the initial velocity 
of the billiard particle in Figure [5] makes with the normal vector to the horizontal dashed line 
pointing downward and r the Euclidean norm of the velocity vector. Then v can be written, in polar 
coordinates, as dv(x,9,r) = Cr 2 exp (-^2) cos 9 dxdOdr where C is now a different normalization 
constant. For each value of r, the measure cos 8 dxd9 is invariant under the billiard map restricted to 
a constant energy surface (see [5]; also compare with our more general Theorem U). In our case, it is 
invariant under the return billiard map T restricted to each coordinate r-slice. Therefore, v is itself 
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T-invariant. We can then apply Proposition [T] to conclude that [i is P-stationary. It will be shown 
shortly that P is an integral operator of the form (Pf)(v) = f Q k(v,u) f(u) du, where k(v,u) > 
for each v and all u. In particular, it is indecomposable and non-periodic, and for each v, S V P is 
absolutely continuous with respect to the Lebesgue measure on the half- line. By, say, Theorem 7.18 
of pQ, the stationary measure is unique and the claimed convergence holds. □ 




Figure 6: Evolution of an initial probability measure, fio, of the free mass velocity, having a step function 
density. The graphs in dashed line are the initial and the limit density t>exp(-i; 2 /2), and the other graphs, 
from right to left, are the densities of po-P™ at steps n = 1, 10, 50, 100, of the Markov chain of Proposition \S\ 
We have used a finite rank approximation of P obtained by numerically simulating the mechanical system 
with mass ratio mi/m2 = 100. 



Reverting to the non-scaled variables and introducing the parameter j3 such that /ST 1 = m\o\, 
where <j\ is the variance of the velocity of the bound mass mi, then the stationary distribution for 
the speed of the free mass has density 

Pmb(w2) = l3m 2 v 2 cxp {^fi 7712 ^ 2 j ■ 

We interpret j3 as the reciprocal of the wall temperature. 

3.3 The random map 

Proposition^ gives the equilibrium (stationary) state of the free mass velocity process. This equilib- 
rium state is arrived at by iterating a random map on (0, oo) with transition probabilities operator 
P. We wish now to describe this random map more explicitly. In the following analysis, we assume 
that 7Ti i > 3m2. First we set some notation: Let 7 := \fm2Jrn1 = tana, where a is the angle of the 
billiard table triangle indicated on Figure [5] Define 

I-7 2 2 7 1 - 6 7 2 + 7 4 - _ 4 7 (1 -7 2 ) 

a: "T7^' i + 7 2 ' a: " (1 + 7 2 ) 2 ' (i + 7 2 ) 2 ' 

Also define the functions 

7 M , . 2(1 - 7 2 ) 4 7 H 

Pv(w) :=—== — , q v (w) := — — —-— — — , 

^J\ + ^2 v 1+7^ 1+7^ v 

and introduce the partition of (0,oo) into intervals 1^ = \w\P, i = 1,2,3,4, where 

I\ ■= (0, tana], I2 '■= (tana, tan(2a)], -Z3 := (tan(2a), tan(3a)], I4 := (tan(3a), 00). 
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It is useful to note that tan(2a) = 27/(1 - 7 2 ) and tan(3a) = 7(3 - 7 2 )/(l - 37 2 ). To simplify the 
description of the map, we make the assumption that mi > 3m2, which is equivalent to a < 7r/6. The 
random map can now be expressed as follows: Choose w € 1R at random with probability £ (say, the 
Gaussian probability with temperature a 2 ) and define the affine maps 

F?(v):=av + bw, F?(v):=av-bw, F^(v) := -av + bw. 

These are the deterministic branches of the random map (see Figure [7]). 




Finally, let F w : (0, 00) -*■ (0, 00) be the piecewise affine random map defined on each interval 
of the partition as follows. Case I: If w > 0, then F w (v) = F^{v). Case II: If w < 0, then 



F w \ n (v):=F^( 



F% 2 (v) := 



F% 3 (v) := 



F^"'(v) w. prob. p v (w) 
F^(v) w. prob. q v (w) 
y F^(v) w. prob. 1 - q v (w) -p v (w) 



\Fi(v) w. prob. p v (w) 
\F^(v) w. prob. l-p v (w) 



(«):= 



I -^l ( u ) w - prob. p v (w) 
2 



F^"\v) w. prob. l-p t ,(u;) 



('w. prob.' = 'with probability.') These expressions are obtained using the standard idea of "un- 
folding" the polygonal billiard table and some tedious but straightforward work. 



3.4 A remark about symmetry 

Before continuing with the analysis of the example, let us briefly examine a small modification of it 
to illustrate a general point concerning symmetries. 

The modified example is shown in Figure |S] By Proposition 0] its Markov operator is self- 
adjoint on L 2 (R, /4 ym ), where /i sym is the even measure on the real line whose conditional probability 
distribution, conditional on the event that the free mass approaches from the right-hand side of the 
wall, equals the stationary measure asserted in Proposition [5] Denoting this operator by P syia , then 
f S ym = (P + -P*)/2, where P is the Markov operator of the first version of the example. 

Although this remark illustrates a useful general point made in Subsection l2.3[ it turns out that 
P is already self-adjoint in the present case. In fact, the map J on X such that it o J = J on is the 
identity so the symmetry S can be taken to be the identity map itself and Proposition [4] applies to 
the original system. 
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Figure 8: A symmetric version of the example of Figure [S] (top of figure). The free particle may approach 
the wall from the left or from the right, with equal probabilities. The states of the bound masses on the 
right and left are equally distributed and independent. The map J is the time reversing map, and S is a 
symmetry compatible with J, in the sense that Proposition U applies. 



3.5 The integral kernel and compactness 

We now wish to show that the operator P on L 2 ((0, oo),p) is Hilbert-Schmidt. This is the content 
of the below Proposition [5] 

It is not difficult to show that P is an integral operator, (Pf)(v) =■ / °° n(v,u)f(u) du, whose 
integral kernel k has the following description. Write p a (w) ■= p va .n(w) to emphasize the parameter 
<7 of the Gaussian distribution. Then, 

K{v, u) = p ba (u - av) 2^ Q t \v, — - — j l Ji(v) (u) + p lry (u + av) ^ Q, \v, — = — j lj. (u) 

where 1a is the indicator function of a set A, the Qi arc 

Qi(v,w) = Q 2 {v 1 w) = 1, Q 3 (v,w) =p v (w), Q4(v,w) = q v (w), Q 5 (v,w) = 1 -p v (w), 
the Qi are 

Q^v^w) = l-p v (w), Q 2 (v 1 w) = l-p v (w) -q v (w), 
the intervals Ji and Ji are 

Ji = (av,oo), J 2 = (cw,oo), J 3 = {av,cv),J A = (0,v/c),J 5 = (v/c,av),Ji = (v,cv),J 2 = (cv,v) 

and, finally, 

7 2 (3 + 3 7 2 +7 4 ) 3-7 2 

C=~ 7TT-, TTT, C~ 



(l+7 2 )(3- 7 2 )' I + 7 2 ' 

Let p be the measure on (0, oo) having density p !lac (v) = o~ 2 v exp (-u 2 /2er 2 ) and K(v,u) the 
integral kernel of P relative to p. Thus (Pf)(v) = / °° K(v,u)f(u) dp(u). Then there exists a 
constant C such that 

C 1 ju-av\ 2 u 2 

(3.1) K {v,u) ■■= Pba{u-av)/p beB (u) = — expj-— I — - — I + — 

Define Ko(v,u) similarly, by substituting a and b for a and b. 
Proposition 6. Let K(v,u) be one of the following kernels: 

— j u - av\ 

J\o(v,u)l( cu ,oo)(w), A'o(w,m)1(™,<x»)(u), or K (v,u)q v l — - — I l( 0jt , /-)(") 

where c is a positive constant. Then K has finite Hilbert-Schmidt norm with respect to the measure 
p on (0, oo). It follows that P is a Hilbert-Schmidt self-adjoint operator. 
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Proof. This amounts to showing that f °° K(v,u) 2 d/i(v) dfi(u) < oo. Expressing the integrand 
in terms of the Lebesgue measure dv du, omitting multiplicative constants, and setting a = 1 for 
simplicity, we have to show, for the first of the three kernels, that 



r 00 r°° I (u-av\ 2 2 u 2 +v 2 \ i> . . , , 

I:= Jo Jo eXP H^J +U r~ \-hc V ,~)(u)dvdu< 

Making the substitution s = v/u, we obtain 



sdsdu, 



which clearly is finite. The second kernel is treated in the same manner. To deal with the third 
kernel, first observe that 

/(-) : = ) 2 + ^ * e := \{l - « 4 )/(l + a 2 ) 2 > 

for all s. We have used a 2 + b 2 = 1 and a = (1 - 7 2 )/(l + J 2 ) < 1. The same change of variables, 
s = v/u, now yields 



I-2fue, P (-eu 2 ) f exp ((i^ + ^ - e) 



where the identity q v ((u - av)/b) = 2/s for u < u/c was used. The value of the integral in s decreases 
in u as 0(l/u), so 7 converges. The actual kernel of P is a union of kernels of the three types 
considered, so it also has finite Hilbert-Schmidt norm. It is interesting to note that the norm 
worsens as 7 approaches 0. □ 

It is clear from the description of the integral kernel of P (and by using the general results and 
definitions from, say that Markov chains associated to P are Lebesgue- irreducible, strongly 
aperiodic, recurrent, and admit a unique stationary probability. 



3.6 A perturbation approach to the spectrum of P 

For convenience, we make the velocity variables dimensionless by dividing them by er, which is the 
standard deviation of the wall-mass velocity. In this way, the stationary probability distribution for 
the free mass becomes p(z) = zexp(-z 2 /2), where z = v/a. The specific form of the (dimensionless) 
velocity distribution of the wall mass will be unimportant, but we assume that it has mean zero 
and variance 1. Having fixed the temperature, the main parameter of interest is 7 = \J 1112/1121. Wc 
denote by P 7 the Markov operator acting on L 2 ((0, 00), where d/i(z) = p(z)dz. Then the main 
remark of this subsection is the following proposition. 

Proposition 7. If <fi is a function of class C 3 on (0, 00) vanishing at and oo, then 

(P^)(,)-^) 
7^0 27 2 v n ' 

holds for all z > where L , the billiard Laplacian of the system of Example [3 is defined by 

(Lcf>)(z):=(±-z ) )<p'(z) + <p"(z). 

Equivalently, L can be written in Sturm- Liouville form as Lip = p 1 j^{p^)- This is a densely 
defined, self-adjoint operator on L 2 ((0,oo),n). 
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Figure 9: Comparison of the second eigendensity of P, obtained by numerical approximation, and the second 
eigendensity of the billiard Laplacian £, which is (1 - z 2 /2)p(z), both normalized so as to have maximum 
value 1. The graph of the latter (dashed line) has been intentionally offset upwards by a small amount to 
better distinguish the two. We have used 7 = 0.1; the numerical value for the second eigenvalue of P (after 
A = 1) was found to be 0.9606, to be compared with I + 27 (-2) = 0.9600, which uses the eigenvalue -2 of L. 



Proof. For the sake of brevity, we give the basic idea of the proof in the special when the velocity 
distribution of the wall-bound mass is Bernoulli, taking values ±1 with equal probabilities 1/2. We 
make the additional simplification of ignoring the branch F^' of the random map (see Figure [7]) . 
Notice that the intervals 1^,1^,1^ (same figure) lie in [0,37 + 0(j 2 )]> so om Y ^1" anci arc 
expected to be important. Thus we consider the simpler random system define as follows. Let 
Fi(z) ■■= az + b and -^2(2) -az-b, where we approximate a = 1 - 27 2 and b = 27. Let p := 7/z. Then 
the approximate random dynamics corresponds to applying F± with probability (1 +p)/2 and F2 
with probability (1 -p)/2. Define the fc-th moment of scattering as 

& k (z) ■= E Z [(Z - z) k ] = (F 1 (z) - zf + (F 2 (z) - zf , 

where Z is the random speed after collision, z is the speed before collision, and E z [-] indicates 
expectation given z. From this general expression we derive 

£ 1 (z) = 2 7 2 (i-z), £ 2 (z) = 4 7 2 + 0( 7 4 ), Z k (z) = 0( 1 k ). 

Now, we expand (P 7 y> _ y)(#)/27 2 = E z [(ip(Z) - ip(z)) /2"/ 2 ] in Taylor polynomial approximation 
up to degree 2 and obtain 



(P^)(z) = X_ ^, {z)Ez [z _ z] + y (z)Ez [{z _ z)2] + Ez [Q {{z _ 2|3)] j = {Zv){z) + 0(7)j 

proving the main claim in this very simplified case. The general case, although much longer and 
tedious to check, can still be obtained in a similar straightforward manner. □ 

An eigenmeasure of P is defined as a signed measure v such that vP = \v. The density of an 
eigenmeasure relative to Lebesgue measure on the half-line will be referred to as an eigendensity of 
P. They have the form <pp, where p is the stationary probability density and 4> is an eigenfunction, 
P(p = \<fr- Based on the proposition, the first few eigenvalues and eigendensities of P for sufficiently 
small values of 7 are expected to be approximated by those of the operator I + 2^ 2 L. We do not 
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study this spectral approximation problem here (this is part of a more general study that will be 
presented elsewhere), but only point out the numerical agreement for the second eigenvalue and 
eigendensity shown in Figure^ 

The spectral theory for L corresponds to a standard Sturm-Liouville problem. In fact, under 
the change of coordinates x = z 2 /2, £ is (up to a constant multiple -2) the differential operator of 
Laguerre's equation 

2(xe~ x <t> x ) x = \e~ x 4>. 

Polynomial solutions exist for A = -2n, where n is a non-negative integer, and the corresponding 
cigenfunctions are easily obtained by textbook methods. The second polynomial cigenfunction (the 
first being the constant function) is given by 4>(z) = 1 - z 2 /2, associated to eigenvalue A = -2. Thus 
it is natural to expect that the spectral gap of P is approximately Aj 2 for small values of 7. 
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Figure 10: Asymptotics of the spectral gap of P for small values of the mass-ratio parameter 7. The discrete 
points are the values of the gap obtained numerically. The solid curve is the graph of f(-y) = 47 2 , suggested 
by approximating P by a second order differential equation. 



4 Other examples 

We give a few further examples of simple systems to illustrate the content of the main theorems. 
4.1 Wall systems without moving parts 

In this subsection we very briefly consider examples having a trivial wall system, for which the Gibbs 
canonical distribution does not make sense. These are nevertheless interesting, and we have studied 
them in some detail in previous papers. (See [BJ|S1|5].) Our only concern here is to see how they fit 
into the present more general set-up. 

By assumption, M wall reduces to a single point. If we further assume that the molecule is a point 
particle, then the only dynamical variable of interest is the velocity before and after the collision 
event, v, V e H, where H is a half-space in R fe+1 . By conservation of energy, ||u|| = || so it suffices 
to take the hemisphere S + of unit vectors in H as the state space for the Markov chains. The 
only random variable is the point in T k , assumed to be uniformly distributed. Given an essentially 
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bounded function / on 5 + , the Markov operator applied to / takes the form 



(4.1) 



where ^ v (x) is the post-collision velocity with initial conditions (x, v) . A first example was suggested 
by Figure[2] Planar billiards, as in Figure fTTl provide a large and interesting general class of examples 
of a purely geometric nature, for which the operator P is canonically determined by the billiard shape. 

In the next proposition, let J : S + -*■ S + be the linear involution that sends the north pole to 
itself and points on the equator to their antipodes. We use the same symbol to denote the induced 
composition operator on functions on S + . A unit of the periodic contour which, for the example of 
Figure [TTJ is shown on the right-hand side of the figure, will be called a billiard cell. A billiard cell 
is symmetric if it is invariant under u — u in T k (which induces the map J on velocities) 

Proposition 8. For the systems without internal moving parts as described by the operator P in 
\4-l\ the probability measure fi on S + defined by 



is stationary, where dA is the Euclidean k- dimensional volume form on S + , n is the unit vector 
perpendicular to the boundary of H, pointing towards the billiard surface, and the inner product is 
the standard dot product. On the Hilbert space L 2 (S + ,[i), the operator P is bounded of norm 1 and 
satisfies P* = JPJ. If the billiard cell is symmetric, then P is self-adjoint. 

Proof. This follows from the general description of invariant measures of Theorem [T] and the fact 
that the conditions of Proposition [JJ are satisfied. See also jS]. □ 



Figure 11: Billiard collisions between a point particle and wall with periodic contour. A point particle 
reflects off of a hard surface with periodic relief in ordinary billiard fashion. Here M mo i and M wa n consist of 
a single point each, and M is the subset of R x T 1 indicated on the right-hand side of the figure. It is given 
the induced flat metric and the potential function is constant. 

The operator P is often (and, for the specific contour of Figure [TTJ this is a consequence of results 
in [B] or [5]) a Hilbert-Schmidt operator. A problem of particular interest is the relationship between 
its spectrum of eigenvalues and the geometric features of the billiard cell. We refer to [5] and [5] for 
more information. 

Another example to which Proposition [3] applies is shown in Figure [T^J In this case, the wall is 
featureless but the molecule is not: M wall consists of a single point and M mol can be identified with 
the circle S 1 . We assume constant potentials. Since the wall is translation invariant, the length 
scale for T 1 is not specified (and not needed). Let I be the fixed length of the arm connecting the 



dfi(v) : 



r(fc/2 + l) 

^72 



(v,n) dA(v), 



R 
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two masses and m = m\ + mi the total mass. Let (x, y) represent the coordinates of the center of 
mass of the dumbbell molecule. Then the configuration manifold is given by 

M = \([6],x,y) : minly - — lsin6, y + — I sine] > 

Here [9] is element in S 1 = R/27rZ. A (9, y)-cross section of M is shown in Figure[13] By introducing 
the scaled angle coordinate 

z := ^^ 19 
m 

the kinetic energy, as a function of the coordinates (x, y, z, x, y, z) on the tangent bundle of M, takes 
the form 

E(x,y,z,x,y,z) = -m (x 2 + y 2 + z 2 ) , 

which corresponds to the standard Euclidean metric in regions of 1R 3 . In terms of these new co- 
ordinates, collisions are described by ordinary specular reflection on the boundary of M. (We are 
assuming here that the surface is perfectly smooth in the physical sense, i.e., there is no tangential 
transfer of momentum between the particles and the surface.) 
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Figure 12: Collision of a rotating dumbbell with a flat surface. 



By restricting attention to a cross section (x = constant) this 3-dimensional system can be reduced 
to a 2-dimensional system that is very much like the one of Figure [TTJ with z taking the role of the 
length coordinate on T 1 in the first example. If we assume that z is random, uniformly distributed, 
then we have a system that is of essentially of the same kind as that of the previous example. 



y 




Figure 13: A cross section of the manifold with boundary depicted in Figure [T2l 



Letting the normalized speed u = v/v max of the center of mass of the dumbbell molecule be the 
variable of interest (assuming, for simplicity, that the constant horizontal momentum is zero), then 
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P is regarded as a Markov operator with state space [0,1], where v w = ^jW/ m is the maximal 
speed that can be attained for a given, fixed, energy value. Writing u = sin 9 for 9 e [0,7r], the 
stationary distribution /i given by Proposition [5] has the form 

dn(6) = ^sm9d9. 

It is easily shown that, for any initial probability distribution for 9, the corresponding Markov chain 
is (Lebesgue measure)-irreducible and aperiodic, and fx is the unique stationary probability measure. 

Notice that P does not depend on the length I separating the two masses since changing I only 
produces a homothety change of the rescaled region of Figure Q2] (i.e., after the change of variables 
from 6 to z). So P only depends on the mass-ratio. Let 7 := v/mi/ma- It is convenient to set 
I = (7"* +7)/27r. With this choice, the billiard cell contour in the y,z coordinate plane is bounded 
below by the graph of the function 

y = — max {-7 sin(27rz) , j^ 1 sin(27rz) } 

for z € [0,1]. 

Based on the results and arguments from [6] and [8] one should expect P to be an integral 
operator (Hilbert-Schmidt, or at least quasi-compact). We do not attempt to show this here, but 
only offer the numerical observation about the dependence of the spectral gap of P on the mass-ratio 
parameter 7 shown in Figure [Til The graph exhibits a great deal of structure which, at this moment, 
we do not know how to interpret. 
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Figure 14: The spectral gap of P as function of the parameter 7 = where mi > my. 



4.2 Adding potentials 

For an example with a non-constant potential function, consider the system on the left-hand side of 
Figure 1151 This is similar to the two-masses system of Section [3] except that we add a linear spring 
potential acting on mass mi. Thus we consider a spring- mass with (essentially point) mass mi, 
which comprises the wall subsystem, and a point mass mi corresponding to the molecule subsystem. 
Suppose for simplicity that the free mass mi can only move vertically, so the whole set-up should 
be regarded as one-dimensional. It is assumed that there are no potentials involved other than the 
elastic potential of the spring. (In particular, no gravity.) Then M wall is an interval, M mol is a single 
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point, the torus component has dimension 0, and M is the subset of 1R x M wall indicated on the 
right-hand side of the figure. 

The region of interaction is the interval [0,1], where I is a positive number. Let x\, x 2 indicate 
the positions of the masses mi and m 2 in physical space (on the left-hand side of the figure). Using 
the scaled coordinates 

I mi 
y = \ — x 2 



V m \ 2/' 



for the respective positions of ui\ and 7712, the energy function for the system with linear spring 
potential is given by 



E{x,y,x,y) 



(x 2 + y 2 + x 2 \ 

\ mi J 



The motion in M between two collisions (of the two masses or of mi with the bottom wall or the 
semi-permeable wall at I) is given by the functions of t: 



(4.2) 



x(t) = xq cos I yl — t j + x o\J —j^- sin j 




m 1 



* j y(t) = yot + yo- 



m 2 




Figure 15: Collision of a point mass with a spring-mass system. 



The state variable of the Markov chain in this case is taken to be the speed v = \y\ 6 (0, 00) of 
TO2- It is assumed that the statistical state of mi is a Gibbs distribution with parameter /3. For 
concreteness, we describe chain transitions v old >->■ i> ncw in algorithmic fashion: 

1. Choose independent, uniform random numbers U\, U2 in [0,1], and a sign s e {-,+} with 
equal probabilities; 

2. Let £ = --s\nUi\ thus £ € (0, 00) has probability density /3exp(-/3£); 



3. Let £,(£) := minH, 5 (j\f /2 \ and x = (^ff 1 ) 1 ^ sin {{2U 2 - 1) arcsin(i(£))) ; thus x has prob- 



'2£mi \!/ 2 „• 



ability density proportional to over the interval -L(£)\/2£mi/mfc < x < L(£)\/2£mi/mk, 
where h^(x) := ^2(£ - U(x)) and U(x) = kmx 2 /mi is the spring potential in the scaled 
coordinate; 



2G 



4. Set (x, shg(x)) to be the state of the mass mi when 7772 enters the region of interaction, 
and (l\J 7712/771, w id) the state of 7712. Let the system evolve, deterministically with this initial 
condition until m 2 is back at position l\fm^Jm. Along the way, assume that collisions with 
the boundary of the two dimensional region on the right side of Figure [T5] are specu lar and in 
between collisions the trajectory satisfies equations 14.21 When 777,2 reemerges at l^/rn^Jm set 
!)„„ equal to its new speed. 

By Theorems [T] and 0] this Markov process has stationary probability given by the Maxwell- 
Boltzmann distribution (after reverting to the variables prior to scaling, with speed u = \Jm/rri2v) 




1 2 

speed 



Figure 16: The figure superposes the graph of the stationary distribution 14.31 and the same distribution 
obtained by numerically simulating the Markov chain according to the algorithm described above. 

In comparing theses distributions with the corresponding textbook expressions, the reader should 
keep in mind the distinction between the Maxwell-Boltzmann distribution in the interior of billiard 
table (the gas container) and the similar distribution on the boundary surface (wall). The latter has 
density proportional to 

,3777.7; • nHI^ 1 exp |-/3^y^ j 
in dimension d, where 77 is the unit normal to the wall surface. 

5 Invariant volume forms 

The main purpose of this section is to prove Theorems [T] and |4l 
5.1 Definitions 

Recall the function d : M R introduced in Subsection HH The set S := M mol x {0} x T k x M wall 
is the level set d = 0. It will be convenient in this section to disregard the part of M given by d > 
and consider S as a submanifold of the boundary of M. Observe that S lies in the interior of the 
product region, where the molecule and wall subsystems are non-interacting. Thus it makes sense 
to define over a neighborhood of S in M the unit vector field v along the R-factor of M mol . Wc 
choose the direction of v so that it points towards the region of interaction. The restriction of v to 
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S is then a unit vector field perpendicular to TS, pointing into M. Let N := TM and N$ ■= l*TM, 
the pull-back of TM to S under the inclusion map t : S -* M. Also define the subset Ng of N$ 
consisting of v such that (v,v) > 0, Ng := -iVJ, and Ng := T5. These are all bundles over S 1 . We 
often denote fibers of a bundle using subscripts, as in N q = T q M. When this is inconvenient, we use 
function form, so that Ns(q), for example, is the fiber of N$ above q e S. Projection maps for these 
bundles will be denoted by the same symbol r. Projection maps for other fibrations will typically 
be denoted by it. 

Clearly, the reflection map R maps Ng to Ng and Ng to itself. More generally, we can define Ng 
as the restriction (pull-back) of N to dM, and Ng as we did in the case of JV5. The notation N 9± 
may also be used when convenient. On regular points of the boundary, the reflection map is defined 
on Ng. If we wish to emphasize that something is taking place over regular points of the boundary 
(or regular points of the function E), we may indicate this by adding a subscript such as in 

Let (■,•) be a Riemannian metric on M and U "• M -> M the potential function. Unless explicitly 
stated otherwise, functions and tensor fields are assumed to be smooth on interior and regular 
boundary points. The Riemannian metric defines the kinetic energy function k ■ TM ->• 1R given by 

!|i 1,2 1 
2 HI « := 2< 

The (total) energy function of the Newtonian system on M with potential U is E : TM -*■ 1R such 
that 

E(q,v) ■= K(q,v) + U(q). 

We write N(E) := {(q,v) e N : E(q,v) = £}. Clearly, the base point projection r : N -+ M maps 
N(S.) \ zero section into M := {q e M : U(q) < £}. The intersection of N(E) with N$ is denoted 
Ns(E-). Similar notations arc used for the various related sets defined earlier. Whenever convenient, 
we specify points in N simply by v instead of (q, v). For example, we typically write q = t(v), T v N, 
dr v , etc., for v e N. 

Let V be the Lcvi-Civita covariant derivative operator. If X(s) is a vector field along a differ- 
entiable curve s c(s) such that v = c'(0), the covariant derivative of X along c at s = will be 
written ^j-| s _ or, when appropriate, V V X. The horizontal bundle H over N is the subbundle of 
TN defined as the kernel of the connection map K v : T V N -> T T ^M, which is derived from V as 
follows. Let v'(0) e T V N, where s >->■ v(s) is a differentiable curve in N such that v(0) = v; then 



K (<7>«) : = TilH'q '■= ^(V,v)q. 



K v v'(0) - I 



v{t). 



Write ^ := v'(0) and w ■= dir v £. Let Y be a smooth section of it over a neighborhood of q = ir(v) 
such that Y(q) = v and dY q w = Then K v £ = V W Y. 

Let V denote the vertical bundle, which is the vector bundle over N whose fiber above a v e N 
is the tangent space to AO,,) at v. Thus V v is the kernel of dn v , the projection dir v ■ H v -*■ T q M is a 
linear isomorphism, and the direct sum decomposition TN = H ffi V holds. If A is a smooth vector 
field on M, then the horizontal lift of X is the smooth section of H given by 

v^X H (v) := (dT^HS'Xiriv)). 

For each v e N with q = t{v), define the linear isomorphism l v : T q M -> V v , called the vertical lift 
map, by 

d 



l v(w) '■= -r 
as 



(v + sw) 



The alternative notation w (v) or w v will also be used later instead of l v (w). If A is a smooth 
vector field on M, then the vertical lift of A is the smooth section of V given by 

v~X v (v) ■= (X(t(v)))V. 
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5.2 Contact, symplectic, and volume forms 



The manifold N is equipped with the canonical contact form 9 defined by 9 V (£,) '■= {v,dT v (;} q , for 
v e N(q). It is well-known that d9 is non-degenerate, hence a symplectic form on N . In terms of 
the Ricmannian metric, 

(5.1) 6^(^,6) = (KM,dr v &) ~ (K v ^ 2 ,dT v ^) 

from which it easily follows that d6 is indeed non-degenerate and that V and H are Lagrangian 
subbundles. 

The Hamiltonian vector field (associated to the energy function E) is the vector field X E on N 
such that 

(5.2) X E JdO = -dE. 

One easily shows that E and d9 are invariant under X E . Thus X E E = and £>x E dO = 0, where Lx 
indicates the Lie derivative along X. The contact form 6, however, is not in general invariant but 
satisfies £>xQ = dL, where L is the Lagrangian function L(q,v) ■= n(q,v) - U(q). The Hamiltonian 
vector field can be written as 

(5.3) X E = Z -(gmdU) v 

where the geodesic spray Z is the vector field on N defined, at each v, as the horizontal lift of v to 
H v . In particular, dn v Z(v) = v and dir v X E = v. Observe that (q,v) is a critical point of E exactly 
when X E (v) = 0, which can only happen when v = and q is critical for U. 

The Hamiltonian flow is the (local) flow of X E , which we denote by t h> <I> t := . The flow lines 
project under 7r to curves c(t) on M that satisfy Newton's equation, and any solution of Newton's 
equation lifts to a flow line in N. It will be convenient to let 1 1-» <&t(u) represent a trajectory of the 
system through its entire history, which may include collisions and reflections with the boundary of 
M . The Hamiltonian vector field is essentially complete, in the sense defined earlier in Section 11.21 
(part (iv) of list of assumptions). 

Proposition 9. Let v 6 -/V(£) be a regular point for E and let q = t(v) be either an interior point 
of M or in the regular boundary d r M . Then T V N(£) consists of ^ € T V N such that 

(5.4) d(Uo T ) v t + {v,K v t) q = 0. 

The projection dr v : T V N(E) T q M is surjective. In fact, for each w e T q M, 

w A (v) := w H (v) - \\v\\- 2 dU q (w)W(v) e T V N(E) 

satisfies dr v w A (v) = w. The space T v N d+ (£) consists of all £ e T V N such that \5.4\ holds and dr v £ is 
tangent to dM . 

Proof. These claims are easily derived by observing that Equation l5.ll the definition of X E in l5.2[ 
and Equation 15.31 imply 

dE v (0 = d9 v (^X E ) = (K v ^dr v X E ) - (K v X E ,dT v £) = (v,K v O + (grad U,dr v t). 

Of course, £ lies in T V N(E) iff dE v (£) = on regular points of E. □ 

Proposition 10. Let ' (£) be the subset of N d (H) of v such that t(v) is a regular boundary point 
of M and i ■ iV r ^ 9 (£) -> N the inclusion map. Then L*d9 is a symplectic form on N® (£) \ T(d reg M). 
Furthermore, the reflection map R : iVl(£) N® ' (£) leaves 1*6 and L*dO invariant. 
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Proof. Here and in what follows, we assume that all points under consideration are regular for E 
and, if on the boundary, that they are regular boundary points. If v + 0, then v is necessarily regular 
for E. With this in mind, we omit references to 'regular' in the notation from now on. 

The main issue is to check that w := u*d9 is non-degenerate. Let £ € T„TV 9 (£) and assume that 
^vi^v) = f° r au V e T V N 8 (£), where v is not tangent to the boundary of M. First choose 77 e V v 
such that K v rj is orthogonal to v. Then = w„(£,ry) = —{K v rj,dT v £}, so dr v t; is orthogonal to all w 
such that w 1 v. That is, dr v £, is a scalar multiple of v. By Proposition [9] this vector is tangent to 
the boundary, thus zero by assumption, and £ must be a vertical vector. Let now r\ be an arbitrary 
tangent vector to TV S (£) at v. Then as = £*;„(£, 77) = (K v t;,dT v r]) and dr v r] is tangent to dM we 
conclude that K v £ is a scalar multiple of the normal vector n at t{v), that is, £ = Xn v (v) for some 
A € R. From Proposition |H] it follows that = d(U o r)„£ + {v,K v £) n / v \ = \(v,n) T t v \, which implies 
that A = 0. Therefore, £ = 0. □ 

It is useful to introduce the Sasaki metric on TV, which is the Ricmannian metric defined by 

{€,v)v '■= (dT v £,dT v r)) q + {K v £„K v m) q 

for all £,77 € T„TV and q = t(v). In terms of this metric the vertical and horizontal subbundles are 
mutually orthogonal and H v , V v are isometric to T q M under dr v and K v , respectively. 
Define the vector field 

r] := (grad 75)/||grad E\\ 2 , 

the gradient and norm being associated to the Sasaki metric. Observe that dE(if) = 1, so that 
£o$J = £ + s, where $^ denotes the local flow of r/. It is not difficulty to obtain the expressions 

grad E =W + (grad \J) H , ||grad E\% = |t>[* + ||grad U\\ 

where W is the canonical vertical vector field, defined by W v ■= for each v e TV. 

Let n := (d9) m , where m = dimM. Then 57 is a volume form (i.e., a non- vanishing form of top 
degree) on TV. It is also invariant under the Hamiltonian flow since LxdO = 0. 

Proposition 11. Let X E be the Hamiltonian vector field for the energy function E , andr\ the vector 
field introduced in the previous paragraph. Define Ci E := rj J i7. Then 

1. n = dEAfl E ; 

2. X E J n E = midO)" 1 - 1 + m(m - l)(r; a tiff) AdEA ; 

3. L x eVL e = dEA(n -iL XE n E ). 

It follows that the restriction of £l E to each level set TV(£) is non-vanishing (a volume form) and 
invariant under the Hamiltonian flow, and that the restriction of X E J il E to the same level sets 
equals m{d6) m ~ 1 . 

Proof. For part 1, write dE a {l E = fQ and take the interior multiplication on both side with rj to 
conclude that fCl E = Cl E , As £l E does not vanish, we have / = 1. For part 2, observe that 

X E j (n j Q) = X E j (m (rj _i dff) a (dO)™- 1 ) = m d6 (77, X E ) (dO)™- 1 - m(n _i dff) a (X e j (dOy 1 - 1 ) . 

But d9(n,X E ) = dE(n) = 1 by the definition of X E and rj. Also X E j^)" 1 " 1 = -(m-l)d^A(d6») m - 2 . 
For part 3, obtain from £ X E(dE a f2) = that dE a L x eH e = and take the interior multiplication 
on both sides of this equation with 77. □ 
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5.3 Billiard maps 

We fix an energy value £ and assume that A(£) has finite volume relative to Q E . Here we will let 
S denote more generally than before (the regular part of) a sub-manifold of the boundary of M . For 
any given v e iVg(£), define 

7(v) := inf{i > : $ t (v) e (£,)}, 

which is oo if the flow line never returns to iV<t(£). By Poincare's recurrence applied to the Hamilto- 
nian flow, T is finite with probability 1 with respect to the flow-invariant probability measure derived 
from Q E . Now define the return map iVg(£) by T := R o $, where := <&g-(u)(v), R being the 
reflection map. Then T is almost everywhere defined, and by one of our standing assumptions it is 
almost surely smooth. (Section [121 assumption (v); see [3] for how this point concerning smoothness 
is argued in the simpler case of plane billiards.) We denote by fl E ' the pull-back of X E J fi to 
Ng(£.) under the inclusion map. By PropositionlTTlthis form agrees with the pull-back of m(d9) m ~ 1 . 

Proposition 12. The return map T ■ Ag(£) Ng(£.) preserves fi ' almost everywhere. 

Proof. This involves a standard argument, which we briefly recall. Let v e iV^(£) admit a neighbor- 
hood U where T is smooth. Let c : [0,T(u)] -*■ N(E) be the orbit segment connecting v to ^^^(v), 
and 71 a closed curve contained in It. Let D be a smooth embedded disc contained in U that is 
bounded by 71, and denote by 72 the image of 71 under $. Then 71 sweeps out a surface E under 
the Hamiltonian flow such that the boundary of E is the union of 71 and -72, where the negative 
sign indicates orientation. Notice that the restriction of d9 to E is as E is constant on this surface 
and the interior multiplication of dO by the Hamiltonian vector field is -dE. So 

0= f d0= [ 9- [ 9 = [ [9-3>*9] = f d[6-<f>*6]. 

As 71 and D can be made arbitrarily small, we conclude that <fr*d9 = dO. Since R also preserves d9 
according to Proposition[10l the same holds for T. Therefore, T leaves ft E,s invariant as claimed. □ 

5.4 Product systems 

We next specialize some of the above facts to product systems. The notation here is independent of 
that of the rest of the paper. Let M = M x x M 2 . Let t 4 : N t := TM t -* M { and r : N := TM M be 
the tangent bundle maps and let 7Tj be the projection M -*■ Mi. The induced projection N -*■ iVj will 
also be written 7^ , so it makes sense to write 7r.; o t = o m . If there is some possibility of confusion 
we may write, for example, (qi,Vi) = (qi, (dir^qv) instead of Vi = TTi(v) for a given v in N. Either 
way, the product Riemannian metric reads 

(V,w) q = (v 1 ,W 1 ) qi + (v 2 ,W 2 )q 2 . 

Vertical and horizontal lifts, and the corresponding subbundles of TN decompose as expected in 
terms of the respective notions on N{. In particular, the Sasaki metric is similarly decomposed as 
(•,•} = 7rJ"(-,-)i + Tr^i'i ')2- The canonical contact form on N = Ni x N 2 becomes 9 = + tt 2 9 2 , 
where 9i is the contact form on Ni, and the invariant volume form = 7rJ"J7i a n 2 fl 2 . Whenever 
convenient, we omit explicit reference to the projection maps and write, for example, fi = Oi a ^2 
or 9 = 9x+9 2 . 

Assuming that the potential function U on M has the form U = U\ o 7Ti + U 2 ° tt 2 , where Ui is 
a smooth function on M,, the energy function becomes E = E\ o m + E 2 o tt 2 and the Hamiltonian 
vector field on N is written as X E = X\+ X 2 . where Xi is characterized by being ^-related to the 
Hamiltonian vector field on iVj associated to Ei and Tr^-related to for j + i. The (Sasaki) gradient 
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of E will be written, with slight abuse of notation, as grad E = grad E\ + grad E\ and the vector 
field T] ■■= (grad E)/\\gra.d E\\ 2 becomes 

|| grad 2 || grad E 2 \\ 2 



' || grad -Bill 2 + || grad E 2 \\ 2 ||grad E^ 2 + ||grad E 2 \\ 2 

Proposition 13. Let fl E := r\ J Q be the invariant volume form on the energy level N(£.) and 
similarly define Ctf* := rji — 1 Cti on level sets Ni(Ei). Then the level sets N(E) can be measurably 
partitioned as a disjoint union of product manifolds 

2V(£) = L|iVi(£i)xi\r 2 (£-£i) 

where the elements of the partition are the level sets of E\ : iV(£) -> 1R, and the invariant volume 
il E has the decomposition 

fl E = dE 1 a fif 1 a f2f 2 

adapted to this partition. 

Proof. The main point is to verify the stated form of Sl E . Define by r\ = ai?7i + a 2 r/ 2 . Now, f2 B 
can be written as 

(ariJ7i + CK2T/2) -l (fii a0 2 ) = ai^f 1 a ft 2 + a 2 r2i a Of 2 = ai^f 1 a dE 2 a fif 2 + a 2 dEi a Of 1 Afif 2 . 
Since <iE 2 = -<iEi on N(V), VL E 1 is an odd-degree form, and ai + a 2 = 1, 

ft B = (- ai dE 2 + a 2 dEi) a fif 1 a ftf 2 = dEi a fif 1 a Of 2 
as claimed. □ 

The Gibbs canonical distribution on Ni with temperature parameter f3i is the probability measure 
on Ni defined by the form 

d ■= — T^rSlf* *dEi, 

where Zi(fii) is a normalization constant. The following trivial but key observation must be noted. 

Corollary 2. If the states of the two subsystems are distributed according to the Gibbs canonical 
distribution with same parameter f3, then the state of the product system is also distributed according 
to the Gibbs canonical distribution with parameter (5. 

Proof. Let E = E 1 + E 2 and define Z(f3) = Z 1 (f3)Z 2 ((3). Note that dE 1 a dE 2 = dE 1 a dE. Due to 
Proposition Q21 

Ci a C2 = ± e , I ] . ^f 1 A n E 2 A dE 1 A dE 2 = ±^—Q E A dE. 

The measure obtained from £1 a £ 2 is already normalized, so Z(f3) is the correct denominator. □ 

Theorem Q] can now be seen to follow from Corollary [2] and Proposition Q] If the state of the 
wall system has the Gibbs distribution with parameter j3 and the state of the molecule system 
is given, prior to entering the interaction zone, the Gibbs distribution with the same parameter, 
then the state of the joint (product) system has a probability distribution which is invariant under 
the deterministic return map to the non-interaction zone. Thus the molecule factor of the state 
distribution of the total system upon return to the non-interaction zone remains the same. 
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5.5 Frame description of the volume forms 

Let m be the dimension of M and [/cMan open subset on which is defined a smooth orthonormal 
frame of vector fields {ei, . . . , e m }. Let Njj be the subset of elements in N with base point in U. 
Define 

:= {(q,u,E.) e [/xR m xR: || u || = 1 and U(q) < £}. 

Thus 'Nu is an open submanifold of U x S™ 1 ^ 1 x R. Let N[/(£) denote the submanifold mapping to 
£ under 7r3 : Njj -> R. Let {c\, . . . ,c m } represent the standard basis of R m and u ■ v the ordinary 
inner product. Observe that 



as 



it + sc,- 



=o || u + sa\\ 



is tangent to S™ 1 ^ 1 at a unit vector u, and {c^ , . . . , c^ l _ 1 } is a basis of T u 5"" _1 for all it not perpen- 
dicular to c m . For these u, let {</?i, . . . , <p m -i} be the dual basis associated to {c\, . . . , cj^,_ 1 }. In 
terms of this dual basis, the (standard) Riemannian volume form on S m ~ l is 

(5.5) <— = (-l)™- 1 W -c m ^A...^ n _ 1 . 

This is obtained from w„ phoro = u J (c^ a • • • a c* n ) , where the c* constitute the dual standard basis, by 
evaluating this form on the vectors c*(u). The Riemannian volume form on M (up to sign) is 

io M := el a ••• a e* m , 

where the e* form the dual frame on T*M . By identifying e, with (ej, 0, 0) and with (0, c* , 0), we 
may think of and as tangent to N[/(£), and {e l7 . . . , e* n ,ipi, . . . , as a frame of 1-forms 
on yfu(&). 

We now introduce a diffcomorphism F : -*■ Njj by 

(5.6) F(g jU) £) = (g,/i e ( ) 



{q,h E (q)Y l u. l e t (q)j, 



where /i £ (q) := y / 2(£ - U(q)). The inverse map is F 1 (g,?;) = (g,w,£), where = [|^[L 1 (u,ei(g)) g 
and£ = ±H| 2 + £/(g). 

Proposition 14. For any given v,w e iV 9 , define vectors and iy* in T V N by 

< ■= w v u - \\v\\- 2 (v,w) q W v and w A := tyf - H|- 2 dl7(ii;) W„. 

// {wi, . . . , w m } is a basis of T q M and v e N(E) for some £ e R ; then {w A , . . . , w^, w\ , . . . , 
is a basis for T V N(8,), providing decompositions 

TN(Z) = T Y N © T A N and TN = T v N © T A N © Rt?, 

where T v N and T A N are spanned by vectors of the form w v and w A , respectively. Define the forms 
uiij(w) := (VwZj, Letting (q,v) = F(q,u,E), then 

m 

dF(q,u,£)Ci = h E (q)eJ(v) and dF^^a = e A (v) + £ w sr (ei)M r e«(«)- 

r,s=l 

TTie vector field rj transforms under F according to 

! _/ groa^/ ( \\v\\ q 1 'Zj{v,e j ) q uj ji {grad q U) \ \ 

" ^UkP + IWl HI 2 + IWI 2 // /■ 
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Proof. We only obtain dF v w and dF v w to illustrate the method of calculation. First, dF v w 
equals 



d_ 

da 

where ^ = 



s=0 



F 1 (u + sid) 



/ / (z; + sw,ej) q \ 1 
=o\ \ lk + sw|| g j'2 



\v + swf q + U(q)) = (0,(£ i ),(v,w) q ), 



(v, w)v/\\v\\ 2 , ei) = \\v\\ 1 (K v w y ,e,;). Before calculating dF v 1 w Ii , first note that 



r/.s 



s=0 



where 7(s) is a differentiable curve such that 7(0) = q and 7'(0) = w, and y 7 ( s )W indicates the parallel 
translation of v along 7. Keeping in mind that ||I' 7 ( s )i'|| = \\v\\ and that ^| s _ (^(s)^, e,) = (v,V w ei), 
we obtain 



r/.s 



s=0 



ris 



The claimed identities are easily obtained from these. 



□ 



We have so far made no special assumptions about the local orthonormal frame {e\, . . . ,e m }. 
Since we may want to consider the invariant volume form D. E near boundary points of M, it 
makes sense to introduce the following concept: The orthonormal frame is said to be adapted to 
a codimension-1 foliation § of U c M if {ei, . . . , e m -i} spans the tangent space to each leaf 5* of S at 
any given point q e U. Recall that the set of elements in N (respectively, in N(E)) with base point 



in S is denoted by Ns (respectively, iVs(£)). The set {e^. 



i e m-i} * s eas ily seen to 



be a frame on Ns(£-). It was noted before that {e^ , • ■ • , e^, e\ , . . . , e^ t _ 1 } is a local, not necessarily 
orthonormal, frame on -/V(£), and it can be shown exactly as in Proposition [5] that for a tangent 
vector to N(E) to actually be tangent to Ns(£) it is necessary and sufficient that its projection be 
tangent to S. In particular, if {e%, . . . , e m } is an adapted frame, the distribution in TN(H) spanned 
by { e i,---, e m-i, e iV--7 e m-i} is hwolutive. 

Proposition 15. Define on Njj \ {zero section} the functions 

ipi(v) := {v,ei) q and ip (v) := (\\v\\ 2 + \\grad U\\ 2 ) 1 dll q (v), 
,e m } is an orthonormal frame on U. The Hamiltonian vector field X E has the form 

dU(e m ) 



where {ex, ■ 
(5.7) 



m rn m m—1 / 

X E = E M - E dUieiX = E - E d U(e l ) 

1=1 i=l 2=1 2 = 1 \ 



i=l i=l 

T/ie contact form 9 restricted to Njj can be written as 



-Ipi 



(5.8) 



where {rj* , e^, . . . , e^, e^, . . . , e^ t _ 1 } is £/ie c/wa/ basis 0/(77, e^, . 
ft B on each N(E) over the set U n M can be written as 



i °m> °1 ' ■ 



> e m-i}- The volume form 



(5.9) 
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Now suppose that {e\, . . . ,e m } is adapted to a local codimension-l foliation S and let 

(5.10) 40 = + •■ -+^-1^-1 

be the restriction of 9 to the leaves Ng{£)- Then Lgd9 is a symplectic form on each Ns(£.), and it 
can be written as 



i,j=l \ n t I i,j=l n E 

It follows that 



dU(ej)tp j - dU(ei)ifh j „ 
hi 



(5.12) lUdBr- 1 = ±f|e? a a a el a ... a fi B = ±mf^ a - a a ^ a - a e^. 

The volume form f2 s transforms under the diffeomorphism F : Nj/(£) -»■ AT[/(£) defined by 15.61 
according to 

F*f~*E , _ V.TCI— 2 * M . _* sphere 

T7ie symplectic form on a hypersurface S in U is expressed under F according to 

F^de)™- 1 = ± (^-^) hf-^c 



~1lu s a 7r£w' : 



Proof. All of this follows straightforwardly from the definitions and basic facts. We only make a few 
comments. Identity 15.91 results by noting that a ( ( X e j fl E ) is a (2m - l)-form 011 N(£.), thus it 
can be written as f£l E , where the function / is found by applying the interior multiplication with 
X E and using that -0 m is the coefficient of X E for the basis element e^„. Item 2 of Proposition [Til is 
also needed. Identity l5.11l can be derived with little effort by using the identity l5.1[ which expresses 
the symplectic form d9 in terms of the Sasaki metric. Identity 15.121 is a consequence of 15.111 and 
the identity det(/ + ab l ) = 1 + b l a, where / is the identity matrix, a, b are column vectors, and b l 
is the row vector associated to b after transpose. It should be kept in mind that the ej span an 
(m- 1) -dimensional subspace at each point, so they are linearly dependent. In fact, they satisfy the 
equation YJiLi ipi^l = 0. □ 

Theorem |4] is a corollary of the proposition. 
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